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^ ' Abstract — We address the estimation of quantiles from heavy-tailed dis- 

tributions when functional covariate information is available and in the case 
where the order of the quantile converges to one as the sample size increases. 
Such " extreme" quantiles can be located in the range of the data or near and 
even beyond the boundary of the sample, depending on the convergence rate 
of their order to one. Nonparametric estimators of these functional extreme 
quantiles are introduced, their asymptotic distributions are established and 
their finite sample behavior is investigated. 
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■ 1 Introduction 

An important literature is dedicated to the estimation of extreme quantiles, 
i.e. quantiles of order 1 — a with a tending to zero. The most popular 
^ . estimator was proposed by Weissman [28], in the context of heavy-tailed 

I distributions, and adapted to Weibull-tail distributions in [TUIIH]. We also 

refer to for the general case. 

In a lot of applications, some covariate information is recorded simultane- 
ously with the quantity of interest. For instance, in climatology one may 
be interested in the estimation of return periods associated to extreme rain- 
fall as a function of the geographical location. The extreme quantile thus 
depends on the covariate and is referred in the sequel to as the conditional 
extreme quantile. Parametric models for conditional extremes are proposed 
in [9l [27j whereas semi-parametric methods are considered in [U [22] . Fully 
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non-parametric estimators have been first introduced in [3, where a local 
polynomial modelling of the extreme observations is used. Similarly, spline 
estimators are fitted in [7J through a penalized maximum likelihood method. 
In both cases, the authors focus on univariate covariates and on the finite 
sample properties of the estimators. These results are extended in [2] where 
local polynomials estimators are proposed for multivariate covariates and 
where their asymptotic properties are established. 

Besides, covariates may be curves in many situations coming from ap- 
plied sciences such as chemometrics (see Section for an illustration) or 
astrophysics [3]. However, the estimation of conditional extreme quantiles 
with functional covariates has not been addressed yet. Two statistical fields 
are involved in this study. In the one hand, nonparametric smoothing tech- 
niques adapted to functional data are required in order to deal with the 
covariate. We refer to [6l fTT} [2^ [25] for overviews on this literature. We 
propose here to select the observations to be used in the conditional quan- 
tile estimator by a moving window approach. In the second hand, once this 
selection is achieved, extreme-value methods are used to estimate the con- 
ditional quantile, see [13] for a comprehensive treatment of extreme-value 
methodology in various frameworks. 

Whereas no parametric assumption is made on the functional covari- 
ate, we assume that the conditional distribution is heavy-tailed. This semi- 
parametric assumption amounts to supposing that the conditional survival 
function decreases at a polynomial rate. To estimate the conditional quan- 
tile, we focus on three different situations. In the first one, the convergence 
of a to zero is slow enough so that the quantile is located in the range of 
the data. In the second situation, the quantile is located near the boundary 
of the sample. Finally, in the third situation, the convergence of a to zero 
is sufficiently fast so that the quantile may be beyond the boundary of the 
sample. This situation is clearly the most difficult one since an extrapolation 
outside the range of the sample is needed to achieve the estimation. 

Nonparametric estimators are defined in Section [5] for each situation. 
Their asymptotic distributions are derived in Section [3l Some examples are 
provided in Section U] and an illustration on spectrometric data is given in 
Section [5l Proofs are postponed to Section [6l 

2 Estimators of conditional extreme quantiles 

Let be a (finite or infinite dimensional) metric space associated to a metric 
d. Let us denote by F{.,x) the conditional cumulative distribution function 
of a real random variable Y given x € E and by q{a, x) the associated 
conditional quantile of order 1 — a defined by 

F{q{a, x),x) = 1 — a, 
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for all X £ E and a £ (0, 1). In this paper, we focus on the case where, for 
ah X £ E, F{.,x) is the cumulative distribution function of a heavy-tailed 
distribution. In such a situation, the conditional quantile q{-,x) satisfies, for 
ah A > 0, 



where 7(.) is an unknown positive function of the covariate x referred to as 
the conditional tail index. Loosely speaking, the conditional quantile q{.,x) 
decreases towards at a polynomial rate driven by 7(x). The conditional 
quantile is said to be regularly varying at with index —j{x), and this prop- 
erty characterizes heavy-tailed distributions. We refer to for a general 
account on regular variation theory and to paragraph 14.21 for some examples 
of distributions satisfying ([T]). 

Given a sample (Yi, xi ),..., (y„, x„) of independent observations, our 
aim is to build point- wise estimators of conditional quantiles. More precisely, 
for a given t £ E, we want to estimate q{a,t), focusing on the case where 
the design points xi, . . . , x„ are non random. To this end, for all r > 0, let 
us denote by B{t,r) the ball centered at point t and with radius r defined 



and let hn^t = /it be a positive sequence tending to zero as n goes to infinity. 
The proposed estimator uses a moving window approach since it is based on 
the response variables Y-s for which the associated covariates x'-s belong to 
the ball B{t, ht). The proportion of such design points is thus defined by 



and plays an important role in this study. It describes how the design points 
concentrate in the neighborhood of t when ht goes to zero, similarly to the 
small ball probability does, see for instance the monograph on functional 
data analysis [17j . Thus, the nonrandom number of observations in the slice 
St = (0,oo) X B{t,ht) is given by m„,t = mt = nLp{ht). Let {Zi{t), i = 
1, . . . , mj} be the response variables Y^s for which the associated covariates 
x[s belong to the ball B{t,ht) and let Zi^„it{^) < ••• < ■^mt,mt(i) be the 
corresponding order statistics. 

In this paper, we focus on the estimation of conditional "extreme" quan- 
tile of order 1 — amt- Here, the word "extreme" means that tends to 
zero as n goes to infinity, making kernel based estimators |15j non adapted. 
In the sequel, three situations are considered: 

(5.1) ~^ mtamt oo, 

(5.2) Omt 0, mtamt c G [l,oo) and ["ita-mj ^ [cj- 




(1) 



by 



B{t,r) = {x £ E, d{x,t) < r} 




1=1 
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(S.3) and nitamt — > c G [0, 1), 

where [xj denotes the largest integer smaller than x. Let us highlight that, 
in the unconditional case, situations (S.l) and (S.3) with c ^ have al- 
ready been examined by Dekkers and de Haan [IT] , the extreme case c = 
being considered in [21], Theorem 5.1. A summary of their results can be 
found in [T3], Theorem 6.4.14 and Theorem 6.4.15. In situation (S.l), 
goes to slower than l/rrit and the point-wise estimation of the conditional 
extreme quantile relies on an interpolation inside the sample, since, from 
Proposition [2] below, q{amt,'t) is eventually almost surely smaller that the 
maximal observation Zmt,mt{t) in the slice St- In such a situation, we pro- 
pose to estimate q[amf,t) by: 

9l(amt,i) = -^mi-LmiQ^J + l.mtl*)- (2) 

In the intermediate situation (S.2), estimator ([2]) can still be used, since 
for n large enough, [mtamtl = [cj > and thus the estimation relies on 
a conditional extreme value of the sample. Let us note that, if c is not 
an integer, then nitamt — ^ c implies [mtamtl Otherwise, if c is an 

integer, then condition [mtamt\ — ^ L*^] is necessary to prevent the sequence 
[mtamtl from having two adherence values and qi{amt,t) from oscillating. 
In situation (S.3), goes to at the same speed or faster than 1/mt and 
the conditional extreme quantile is eventually larger than Zmt,mt{t) with 
positive probability e~'^ > e~^. Thus, its estimation is more difficult since 
it requires an estimation outside the sample. We propose in this case to 
estimate q{amt > t) by: 

= ^mt-[mtl3mt\+'i-,mti'^) if^rnt/c>imtV"^*\ (3) 

where /3mt satisfies (S.l) and 7n(i) is a point-wise estimator of the condi- 
tional tail index 7(t). Such estimators have been proposed both in the finite 
dimensional setting [2] and in the general case [20J, see also paragraph 14. 1 1 for 
some examples. Note that ([3]) is an adaptation of Weissman estimator [28] 
in the case where covariate information is available. The extrapolation is 
achieved thanks to the multiplicative term (/3mt/amt)^"^*^ which magnitude 
is driven by the estimated tail index 7n(t). As expected, the extrapolation 
is all the more important as the tail is heavy. 

3 Main results 

We ffi'st give some notations and conditions useful to establish the asymp- 
totic distributions of our estimators. In the sequel, we fix t € E and we 
assume: 
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(A) The conditional quantile function 



a G (0, 1) H- q{a, t) G (0, +oo) 
is differentiable, the function defined by 

a G (0, 1) ^ A(a, t) = -fit) + a^^^fa, t) G (0, +oo) 

oa 

is continuous and such that lim A(a,t) = 0. 

Assumption (A) controls the behavior of the log-quantile function with re- 
spect to its first variable. It is a sufficient condition to obtain the heavy-tail 
property ([T]), see for instance [5j, Chapter 1. For all a G (0, 1), let us intro- 
duce 

A(a,t)= sup |A(a,t)|. 

aG{0,a) 

The largest oscillation of the log-quantile function with respect to its second 
variable is defined for all a G (0, 1/2) as 



ujn{a) = sup 



q{a,x) 

log 



/(a, x') 



, a G (a, 1 — a) , (x, x') G B{t, ht 



Finally, let kt G {1, . . . , m^} and J^^ = {1, . . . ,kt}. Our first result estab- 
lishes a representation in distribution of the largest random variables of the 
sample Zi{t), i G {1, . . . , mt}. 

Proposition 1 If kt/nit and kfiOnim^ '^"^^^^^ — ). for some 5 > 0, then, 
there exists an event An with ¥{An) — > 1 as n oo such that 

{ilogZmt-i+l,mtii e JfcJ \An} = {{\og q{Vi^rnt,Ti) , i G JfcJ \An} , 

where T4,mt < • • • < Vmt,mt o.^^ order statistics associated to the sample 
{Vi, . . . , Vmt} of independent uniform variables and {Ti, . . . ,Tfcj} are ran- 
dom variables in the ball B{t, ht). 

Note that this result is implicitly used in |20j . proof of Theorem 1. We also 
refer to jl4| . Theorem 3.5.2, for the approximation of the nearest neighbors 
distribution using the Hellinger distance and to |18) for the study of their 
asymptotic distribution. Here, condition A:ja;„(m^ _^ q gj-^ows that, 

the smoother the quantile function is on the slice St, i.e. the smaller its 
oscillation is, the easier the control of the uppest observations is, i.e the 
larger kt can be. 

The next proposition is dedicated to the study of the position of the 
conditional extreme quantile q{a, t) with respect to the largest observation 
in the slice St. 
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Proposition 2 If ujn{m^ ') — )■ for some 5 > then 

• under (S.l), ¥{Zmt,mt < Q{oimt,t)) 0, 

• under (S.2) or (S.3), F{Zmt,mt < Qi(^mt,t)) ^ e"". 

Let us first focus on situation (S.l) where the estimation of the conditional 
extreme quantile is addressed using qi{amt,t), an upper order statistic cho- 
sen in the considered slice. 

Theorem 1 Let (amt) be a sequence satisfying (S.l). 
// {mtajntY^nii^t ^^'^^^) — >■ for some (5 > then, 

{mtam^fl' j^M^ _ 1^ 4 Ar(0,7^(i)). 
\q{amt,t) J 

It appears that the estimator is asymptotically Gaussian, with asymptotic 
variance proportional to ^'^ (t) / (mtamt) ■ Thus, the heavier is the tail, the 
larger is 7(t), and the larger is the variance. Besides, the asymptotic vari- 
ance being inversely proportional to ; the estimation remains more stable 
when the extreme quantile is far from the boundary of the sample. Con- 
sidering now sitTiation (S.2), an asymptotically Gaussian behavior cannot 
be expected since, in this case, the estimator is based on the [cjth uppest 
order statistic in the considered slice. 

Theorem 2 Let (amt) be a sequence satisfying (S.2). 
If ujnim^ — >■ for some 5 > then, 

\q{amt,t) J 
where £{c,j{t)) is a non- degenerated distribution. 

The asymptotic distribution £{c,'j{t)) could be explicitly deduced from the 
proof of the result. It is omitted here for the sake of simplicity. Situation 
(S.3) is more complex since the asymptotic distribution of q2 may depend 
both on the behavior of qi and 7^. In the next theorem, two cases arc 
investigated. In situation (i), the asymptotic distribution of q2 is driven by 
qi. At the opposite, in situation (ii), q2 inherits its asymptotic distribution 
from %. 

Theorem 3 Let {(3mt) be a sequence satisfying (S.l) and let {amt) be a se- 
quence eventually smaller than {Pmt)- Define C,rnt = {nT'tPmt)^^'^ ^^^{Pmt / otmt) 
If {mtfimt)'^^n{''nt ^^^^'') — for some (5 > and there exists a positive se- 
quence Vn{t) and a distribution T> such that 

Vn{t){%{t)-^{t))Av, (4) 

then, two situations arise: 
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(i) Under the additional condition 

Cm, max{t;;^^(t),A(/3„,,t)} ^ 0, (5) 

we have 

(a) Otherwise, under the additional condition 

Vn{t) max [C-], A(/?m,, t)} ^ 0, (7) 

we have 

Vn{t) f q2iamt,t) 
log(/3mt/amJ V 9(amt,i) 

Note that, even though the main interest of this result is to tackle the case 
where (amj is a sequence satisfying (S.3), it can also be applied in the more 
general situation where amt is eventually smaller than /?mf For instance, 
it appears that, in situation (S.2), g2(amt>i) is a consistent estimator of 
q{amf,'t) in the sense that the ratio converges to one in probability whereas, 
in view of Theorem [21 ^i(amt,t) is not consistent. Some applications of 
Theorem [3] are provided in the next section. 



V. 



(8) 



4 Examples 

In paragraph 14.11 the above theorem is illustrated with a particular family 
of conditional tail index estimators. The corresponding assumptions are 
simplified in paragraph 14.21 for some classical heavy-tailed distributions. 



4.1 Some conditional tail-index estimators 

In |20) . a family of conditional tail index estimators is introduced. They are 
based on a weighted sum of the log-spacings between the kt largest order 
statistics Zmt-kt+i,mti ■ ■ ■ , Zmt,m.f The family is defined by 



%{t,W) = Y,^\og(^f^^±^^)w{^/Kt) /Y^Wm^t), (9) 

where is a weight function defined on (0, 1) and integrating to one. 

Basing on Q and considering fi^.t = kt/mt, the conditional extreme quan- 
tile estimator ^ can be written as 



hicimtjt^W) = Zmt-kt+l,mt{t) ^ 



kt \ 



7n(i,W) 
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From [20], Theorem 2, under some conditions on the weight function, 7n(i, W) 
is asymptotically Gaussian: 

kl^\Ut, W) - 7(t)) 4 MiO, 7\t)AV{t, W)), 

where AV{t,W) = W'^{s,t)ds. Letting Vn{t) = k^"^ , we obtain 

CmtVn^it) = log ) OO, 

\mtamtJ 

in situation (S.2) or (S.3), which means that condition ([5]) cannot be satis- 
fied. Thus, only situation (ii) of Theorem [3] may arise leading to the following 
corollary. 



Corollary 1 Suppose the assumptions of f20^ . Theorem 2 hold. Let kt ^ oo 
such that 

k]''^h{kt/mt,t) -^Q and (10) 

fcfa;„(m7*'^^^'') — > for some 5 > 0. (11) 

Let (ctrnt) be a sequence satisfying (S.2) or (S.3). Then, 

kl" [ q2{am.,t,W) \ 4^(o,^2(,^^^(,^^)^, 



log{kt/{mtam,)) V q{amt,t) 

As an example, one can use constant weights W^{s,t) = 1 to obtain the 
so-called conditional Hill estimator with AV{t, W^) = 1 or logarithmic 
weights W^{s,t) = — log(s) leading to the conditional Zipf estimator with 
AV{t,W^) = 2. We refer to [20], Section 4, for further details. 



4.2 Illustration on some heavy-tailed distributions 

Standard Pareto distribution is the simplest example of heavy-tailed distri- 
bution. Its conditional quantile of order 1 — a decreases as a power func- 
tion of a since, in this case, q{a,t) = o-tW. Therefore A{a,t) = for all 
a G (0, 1) and condition (jlOp of Corollary [1] vanishes. Another example is 
Frechet distribution for which 



q{a,t) = 




Here, the conditional quantile approximatively decreases as a power function 
of a since, in this case, q{a, t) ~ a'"^^^'' , the quality of this approximation 
being controlled by 

A{a, t) = — ^"(1 + 0(a)) as a ^> 0. 
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A similar example is given by Burr distributions for which 
and 

A(Q,t) = -7(t)Q-^W(l + 0(Q^''W)), 

with < 0. These results are collected in Table [TJ In both Frechet and 
Burr cases, A(a,t) is asymptotically proportional to a"''*-*^ as a — )• with 
the convention p{t) = — 1 for the Frechet distribution. Note that is 
known as the second-order parameter in the extreme- value theory. It drives 
the quality of the approximation of the conditional quantile g(a, €) by the 
power function a~^^^\ Furthermore, it is easily seen, that for these two 
distributions, the function |A(.,f)| is increasing. Thus, condition (jlOp of 
Corollary [T] can be simplified as mf^^^^kl '^^^^^ — )■ which shows that, the 
smaller p{t) is, the larger kt can be. Finally, if 7 and p are Lipschitzian, i.e. 
if there exist constants > and Cp > such that 

|7(x) — 7(x')| < c-yd{x,x') and \p{x) — p{x')\ < Cpd{x,x') 

for all {x,x') £ B{t,ht)'^, then the oscillation can be bounded by u^nifi) = 
0(/it log(l/a)) as a — > and thus condition (jlip of Corollary [1] can be 
simplified as k^ht log mt — >■ 0. 



5 Finite sample behaviour 

In this section, we propose to illustrate the behaviour of our conditional ex- 
treme quantiles estimators on functional spectrometric data. A question of 
interest for the planetologist is the following: Given a spectrum collected by 
the OMEGA instrument onboard the European spacecraft Mars Express in 
orbit around Mars, how to estimate the associated physical properties of the 
ground (grain size of CO2, proportions of water, dust and CO2, etc . . . )? To 
answer this question, a learning dataset can be constructed using radiative 
transfer models. Here, we focus on the CO2 proportion. Given different 
values j/i, i = 1, . . . , 16 of this proportion, a radiative transfer model pro- 
vides us the corresponding spectra Xj, z = 1, . . . , 16 (see Figured]). Clearly, 
the obtained spectra are non random. They are functions of the wavelength 
and we consider here their discretized version on 256 wavelengths Xij. Us- 
ing this learning dataset, a lot of methods can be found in the literature to 
estimate the CO2 proportion associated to an observed spectrum. One can 
mention Support Vector Machine, Sliced Inverse Regression, nearest neigh- 
bor approach, . . . (see for instance |4j for an overview of these approaches). 
For all these methods, the estimation of the CO2 proportion is perturbed 
by a random error term. We propose to modelize this perturbation by: 

Yi,j = log(l/yj) + cr{ej{xi) - T{1 --f{xi))),j = 1, . . . ,ni, i = 1,. . . ,16, 
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where 

|2 

7(xj) = 0.3 — -TTy \ — - — ^ + 0.2, a = min 



x,||2-mm||x,||2 _ log(l/j/.) 

— r — : — - — + 0.2, a = mm — — --. 

max ||x/||2 — min ||x/|[2 « i (i — 7(Xjjj 



and ej{xi), j = 1, . . . , are independent and identically distributed random 
values from a Frechet distribution with tail index 7(xj) (see Tabled]). Note 
that ||2;j||2 is an approximation of the total energy of the spectrum Xi. The 
above definitions ensure that 7(xj) G [0.2,0.5] and that Yi j > for all 
i = 1, . . . , 16 and j = 1, . . . , nj. Furthermore, since the expectation of €j{xi) 
is given by r(l — 7(xj)), the random variables Yi j are centered on the value 
log(l/yj). Our aim is to estimate the conditional quantile 

q{a, Xi) = F^{a, Xj), for i = 1, . . . , 16, 

where F{.,Xi) is the survival distribution function of Yi^i. To this end, the 
estimator q2{a,Xi,W^) defined in paragraph 14.11 is considered. The semi- 
metric distance based on the second derivative is adopted, as advised in [T7| . 
Chapter 9: 

d\x,,x,)= j {xf\t)-xf{t)f dt, 



where x^^-* denotes the second derivative of x. To compute this semi-metric, 
one can use an approximation of the functions Xi and Xj based on B-splines 
as proposed in [T7|, Chapter 3. Here, we limit ourselves to a discretized 
version d of d: 



255 



d {xi,Xj) = 2^ {{xi,i+i - Xjj+i) + - - 2{xij - Xj^i)} . 

1=2 

The finite sample performance of the estimator in assessed on = 100 
replications of the sample {{xi,Yij), i = 1,...,16, j = l,...,nj} with 
ni = . . . = ni6 = 100. Two values of a are considered: 1/300 and 1/500. 
In the following, we assume that the hyperparameters ht and kt does not 
depend on the spectrum (we thus omit the index t). These parameters are 
selected thanks to the heuristics proposed in [20] which consists in minimiz- 
ing the distance between two different estimators of the conditional extreme 
quantile: 

(^select, Select) = arg min A(g2(a, •, W^""), g2 (a, ., VF^), 

h,k 

where for two functions / and g, 

1/2 
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The estimator associated to these parameters is denoted by ^select- We also 
compute /ioracic and ^oracle defined as: 

(/ioracie, ^oracle) = arg min A(g2 («, • , VF") , ^(a, ■)) • 

h,k 

The conditional quantile estimator associated to these parameters is denoted 
by Coracle- Note that /iseiect, A^seiect, /loracie and ^oracle do not depend on a. Of 
course, the oracle method cannot be applied in practical situations where 
q{a, .) is unknown. However, it provides us the lower bound on the dis- 
tance A that can be reached with our estimator. In order to validate our 
choice of /igeiect and /^select, the histograms of A(gsciect(a, •, W^),q{a, .)) and 
A{qora.cie{ct, .,W^),q{a, .)), computed for the N = 100 replications, are su- 
perimposed on FigureO It appears that the mean errors are approximatively 
equal. Let us also remark that the heuristics errors seem to have a heavier 
right-tail than the oracle errors. For each spectrum Xi the empirical 90%- 
confidence interval of (?opt(tt, Xi, W^) is represented on Figure[3]for a = 1/300 
and on Figured] for a = 1/500. The confidence intervals are ranked by as- 
cending order of the tail index. The larger the tail index is, the larger the 
confidence intervals are. This is in adequation with the result presented in 
Corollary [TJ Finally, on Figure [5] (a = 1/300) and Figure [6] (a = 1/500), we 
draw estimators (?sciect(0) Xi, W^) and Coracle (a, Xi, W^) as a function of |[3;j||2 
on the replication giving rise to the median error A{qseiect{ct, ., W^),q{a, .)). 
It appears that the oracle estimator is only slightly better than the heuris- 
tics one. As noticed previously, the estimation error increases with the tail 
index. 

6 Proofs 

6.1 Preliminary results 

Our first auxiliary lemma is a simple unconditioning tool for determining 
the asymptotic distribution of a random variable. 

Lemma 1 Let (X„) and (Yn) be two sequences of real random variables. 
Suppose there exists an event An such that {Xn\An) = {Yn\An) withF{An) 
1. Then, Yn implies Xn Y . 

Proof of Lemma [1] — For all a; € M, the well-known expansion 

<X) = F{{Xn < x}\An)n^n) + IP({Xn < x}|^^)P(^^), 

where A^ is the complementary event associated to An, leads to the follow- 
ing inequalities: 

¥{{Xn < x}\Ar,)F{An) < P(X„ < x) < P({X„ < x} | A)P( A) + P(^?) • 
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bmce {Xn\An) = {Yn\An), it follows that: 

n{Yn <x}n An) < P(X„ < X) < ¥{{Yn < x} D An) + PM^)- 

Taking into account of 

HYn <X)- nA^) < F{{Yn <x}n An) < nYn < x) 

leads to: 

nYn <X)- P(^^) < ¥{Xn <X)< P(y„ <X)+ P(^^). 

The conchision is then straightforward since P(l^ < ^ P(y < x) and 
P(^^) ^0. ■ 

The next lemma provides the asymptotic distribution of extreme quantile 
estimators from an uniform distribution in a situation analogous to (S.l) 
in the unconditional case. 

Lemma 2 Let Vi, . . . ,Vm be independent uniform random variables. For 
any sequence {6m) C (0, 1) such that 9m and MOm — > oo, 

1 /2 

Proof of Lemma [2] — For the sake of simplicity, let us introduce kM = 
[M^AfJ. From Renyi's representation theorem, 

^ kM /M+l 
i=\ I i=l 

where Ei, . . . , Em+i are independent random variables from a standard ex- 
ponential distribution. Thus, 



1/2 



and, in view of the law of large numbers, we have 



[MOm)^'^ 



M 

M 



(1 + op(1)) + (M0m) 



/ ^ M+l \ 
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Let us consider the three terms separately. First, writing kM = MOm — tm 
with Tm € [0, 1), we have 

since MOm oo. Second, since /cm ~ M6m, the central limit theorem 
entails 

Similarly, it is easy to check that 

e3,Af = Op(<') = Op(l), (14) 

since 6m 0. Collecting (fT2|) . (fT3]) and (fn|) concludes the proof. ■ 



6.2 Proofs of main results 

Proof of Proposition [1]— Under (A) and since the random values {Zi{t), i = 
1, . . . , mt} are independent, we have: 

{logZi(t), i = 1, . . .,mt} = {logq{Vi,Xi) i = I, . . . ,mt}, 

where Xi is the covariate associated to Zi{t). Denoting by ^(i) the random 
index of the covariate associated to the observation Zmt-i+i,m.t{'^)^ we obtain 

{log Zrnt-i+l,mtif). i = l,---,mt} = {log ^(^/.(i) , ) i = l,...,mt}. 

Let us consider the event An = Ai^n H ^2,n where 

An = I min log g^^--"-''^-) ^ >Oy{m,...,u,JcB{t,ht)] and 
A2,n = I min log^(^|:!I^^>0,V(n,,+i,...,n„JcB(t,/ii)|. 

li=kt+l,...,mt q[Vi,mt,Ui) J 

Conditionally to Ai^n, the random variables g(Vi^mt) ^ = 1, are 
ordered as 

qiVkt,mt,Ukt) < qiVkt-l,mt,Ukt^i) < ■ ■ ■ < q{Vi^rnt,Ul), 

and, conditionally to A2,n, the remaining random variables q{Vi^mtjUi), i = 
kt + 1, . . . ,mt are smaller since 

max q{Vi^rnt,Ui) < q{Vkt,nit,UkJ. 
i=kt+l,...,mt 
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Thus, conditionally to An, the kt largest random values taken from the 
set {logg(y^(j),x^(j)), i = l,...,mt} are {logq{Vi^rnt,x^{t)), i = 1,---,A;J. 

de f 

Consequently, for J^^ = {1, . . . , fc^} and letting Tj = we have: 

{logZm,_i+i_mj(t),i G Jkt\^n} = {\ogq{Vi^rnf,Ti),i G Jkt\^n} ■ 

To conclude the proof, it remains to show that T{An) — )• 1 as n — > oo. Let 
us define Smt = and consider the events 

A4,n = < mm log--- —>2uJn[0.m^ 

|^i=l,...,fct q[Vi+i^rnt,t) 



Under As^n, we have 5mt < yi,mt < 1 — ^mt for all i = 1, . . . , m^. Hence, for 



all (uj, Uj) G /ij)^, it follows that, on the one hand 



{yi,mt,Ui) q{Vi,mt,t) qiVj^rntit) QiVi^ratjUi) 

- -777 TV ~ 2'^" ) ' 



and on the other hand. 



mm log — — r- > mm log — — - 

i=kt+l,...,mt q[Vi^mt,Ui) i=kt+l,...,mt q[Vi^mt,t) 



Consequently ^3^„ n A^^n C Remarking that 

since Vmt,mt = 1 - l^.m* and P(Vi,mt > (5mt) = (1 - 5m J™* ^ 1, it thus 
remains to prove that ¥{A4^n) 1- From [5], paragraph 1.3.1, condition 
(A) implies that there exists c{t) > 0, depending only on t such that, for all 
a G (0,1), 

"1 7(t) + A(u,t) 



(/(q, t) = c(t) exp I y 



-du > , 



which is the so-called Karamata representation for normalised regularly 
varying functions. Hence, for all i € Jkt, 
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and it follows that 

, Q\^i,mt:t) ^ / /,\ A/TA J.^^l ^i+l,mt 
^°S^T7 TV > (7(0 - MVkt+l,mt,t))log— , 

leading to 

P(An) > f ( (lit) - A{Vk,+l,m„t)) min log^^ > 20Jn{Sm, 
\ 1=1, ...M Vi,mt 

In view of Renyi representation for uniform ordered random variables, 

{ilog{V-^jV-^l,J, i G JkA = {F„ i G JfcJ, 

where Fi, . . . , F^^ are independent random variables from a standard expo- 
nential distribution, we have 

Pi,m, - P _mm - > - li^^P 



i=i,...,kt i -f{t) J fj^ V lit) 

since k'^iOni^^mt) — ^ 0- Furthermore, Vk^-\.\^mt ~ 

{kt/mt){l + op{l)) and 
A(a,f) — > as a — > entail P2,mt ~^ 1- The conclusion follows. ■ 

Proof of Proposition [2] — From Proposition [H there exists an event An 
with ¥{An) 1 such that {Zmt,mt{t)\^n) = {q{yi,mt,Ti)\An) and thus, 

F{Zn^„mAt)<q{arm,t)) = P fjlog ^^^^^^^ < ol n A 



+ pf{log%^<o|nA^') 

=^ -Ps/mt + P4,mt • (15) 

Clearly, Pi^mt ^ P(^n) ~^ 0- Let us now consider the term Ps^mt- Introduc- 
ing 6mt = m^^^^^^ and A^^n = {^i,mt G l^mtA - Smt]}, we have 

P3,m. = pf|log^^^i^i^<o|nAn>l5,n 
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and standard calculations lead to: 

<?(^l,m*,ri) 



P N log 



< ^ n A,n + HAn) -1<P3. 



< 



mt 



5,n) 



Furthermore, As^n implies 



log 



and thus 



log ^i^^hUh:^ < -UJn{6m,)\ n A5,n) + P(A) - 1 < P3,m, 



< 



which entails 



F ( log ^^^^^ < -^n{5m,) ) + P(A,n) + - 2 < P3,m, 



Let us now focus on the quantity 



< 



(16) 



def 



= F log 



log 



g(^l,mf,t) 



exp 



(9(Fi,t)<e±-"(^'"*)g(a^„t)) 
l-Fi <F(e±'^"(^-*)(7(a„„t),t 
mtlogF (e^'^"^^"^t)q(^amt,t),t^ 



Since e^'*'"^''""*)q'(Q;TOj, t) — )■ cxd and introducing the conditional survival func- 
tion F{.,t) = 1 — F{.,t), we have 

mtlogF(e±""('''"*)g(«mni),t) = -mjF (e±"" )(?(a„, , t), t) (1 + o(l)) 



-mtamt - 



F {q{amt,t),t) 



(1 + «(!))■ 
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As already mentioned, (A) implies ([T]) which, in turn, shows that F{.,t) is a 
regularly function at infinity with index —l/^(t). Hence, since e^'^"^'''"'^ — t- 
1, we thus have (see [5], Theorem 1.5.2), 



F{q{amt,t),t) 



1. 



As a conclusion. 



(17) 



P5,m, = [l-amA^ + 0{l))r\ 

and collecting (fT6]l and (fT7|) leads to: 

[1 - am, (1 + o(l))]™* + P(A,n) + P( A) - 2 
< P3,m, < [l-a„,(l + 0(l))]'"*+P(>l^J. 

Since P(^5^„) — )• 1 and P(^n) — )• 1, it is then straightforward that Ps^mt 
under (S.l) and Ps^mt ~^ under (S.2) or (S.3). Equation concludes 
the proof. ■ 

Proof of Theorem [T] — Let us introduce, for the sake of simplicity, kt = 
[nitamtl- From Proposition [U there exists an event An such that: 



/ Nl/2i 9l(«mt,i) 

[mtamt) ^ log— ^ — 



{mtamt)^/'^ loj 



where ¥{An) — 1- From Lemma [H the convergence in distribution 

r ^l/2l (liykt,mtiFkt) d .r, 2 u\\ 

[mtamt) ' log ^ A/(0,7 (t)), 

is a sufficient condition to obtain 



:i8) 



(m,a^jV2logM^AAA(0,7^(t)). 

A straightforward application of the 5-method will then conclude the proof. 
Let us prove the convergence in distribution (jlSp . To this end, consider 



Pn 



log 



9(^A 



kt,mt 5 



and let 5„ 



-{1+5) 



. Remark that, under (S.l), 



P(i?n < UJniSm,)) > P(^fct,m, G [5mtA " ^^mj) ^ 1- 

Thus, i?„ = Op{uJniSmt)) aiid we have 

log IT:^ = ^^S—JZ + Op{u}n{dmtj)- 



(19) 
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Let us introduce the log-quantile function g{.) = logq(.,t). Clearly, for all 
aG(0,l), 

, A(a,t)-7(t) 

9 (a) = 

a 

and a first-order Taylor expansion leads to: 

q{.arnt,t) 

amtJ 



rritli 



where 6lmt G [min(am,, Vfci.mJ, max(amt, V'fcj^mJ]- Now, Ffc, 

,mt ~ c^mt entails 

Qmt ~ c^mt — and, from (A), 

Then, Lemma [2] implies that 

Ka^J^/^log^^^^^ 4 AA(0,7(t)^). (20) 

Collecting ()19p and ()20p concludes the proof after remarking that condition 
{mtamt)'^tJn{Smt) ^ implies (mtamy^'^uJniSmt) ^0. ■ 
Proof of Theorem [2]— Since q{-,t) is regularly varying with index — 7(t), 
we have under (S.2) that q{l/mt,t)/q{amt:t) ~ i'mtCimtV^^^ c'^^*^ and 
the following asymptotic expansion holds 

qi{amt,t) qi{amt,t) q{l/mt,t) 
log -r = log — ;r + log — 

q[amt,t) q{^/'mt,t) q{amt,t) 

= log4j^ + 7(t)log(c)+o(l). 
q{l/mt,t) 

Now, recall that in situation (S.2), for n large enough, [mjamj = L^J- 
Thus, from Proposition [H there exists an event An such that ¥{An) —> 1 
and 

qi{amt,t) 



log 



q{l/mt,t) 



A)^flog-'^^LcJ,r..TLcj; 



q{l/mt,t) 



Mimicking the proof of Theorem [H we obtain 



TVi r& = -itr TT + OpK O^mt )• 

g(l/mf,t) q{l/mt,t) 

To conclude, one can remark that (/(V^^cJ.mj > is the [cjth uppest order 
statistics associated to a heavy-tailed distribution. In such a case. Corol- 
lary 4.2.4 of [13] states that q{V^i.j^„^^,t)/q{l/mt,t) converges to a non- 
degenerated distribution. This asymptotic distribution is explicit even though 
it is not reproduced here. ■ 
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a 



Proof of Theorem [3] — Observing that 

logg2(amt,i) = \og qi{l3mt,t) +%{t)\og 

leads to the foUowing expansion 

, g2(amt,i) , qi{l3mt,t) 

— ^ TV = log —7^ — ;r 

q{amt,t) q{Pmt,t) 

+ log(^){Ut)-l{t)) 



mt 



a 



mt 



l°g -T3 IV + 7(*) log 



First remark that, under (A), as aheady shown in the proof of Proposition [H 

q{arm,t) [^^^ ^{t) + A{u,t) ^ 

and thus, ^6,mt can be simphfied as 

/■^-t A{u,t) , 

^6,mt = / an 



which leads to the bound: 

t^mt 



lee.mj < A(/3™,,t)log 



Or 



The two additional conditions are now treated separately since, under condi- 
tion ([5]), the asymptotic distribution is imposed by C'i,mt whereas, under 
the asymptotic distribution is imposed by Cb^mt- 

(i) Under ([5]), Theorem [1] entails that 

imt(3^y/^U,mA^iO,l\t)) (21) 

and 

(mi/3^jl/2^5,m, = UtV-\t)Vn{mnit) - -f{t)) ^ 0, (22) 

from (g]) and ([5]). Finally, 

(mt/3„ji/2|e6,„J < Cm, A (/?„,, t) ^ 0, (23) 
from (P. Collecting dH]), ([22]) and ([231) concludes the proof of dSD. 

(ii) Under ([7]), Theorem [1] implies 

"""^^^ U,mt = Vn{t)C;n]{mtPmtf'HA,mt ^ 0. (24) 



log(/3mt/a 
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Moreover, from 



log(/3mi/amt 

and finally, 



"""^^^ ^5,™. = vnm%{t) - 7(t)) A P (25) 



"""^^^ l^e.mj < A(/3„„tK(t)^0, (26) 



log(/3mt/amt; 

under Collecting ([25]) and ([26]) concludes the proof of ([8]). ■ 
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Figure 1: Representation of the 16 spectra as functions of the wavelength. 



23 




I I I I I 

Oe+00 1e-05 2e-05 3e-05 4e-05 



Figure 2: Comparison between the error distributions obtained with the 
heuristics method (transparent) and the oracle method (gray) on = 100 
samples. 
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Figure 5: Comparison of the true quantile of order a = 1/300 (solid line) 
with the estimated quantiles by the heuristics strategy (dashed line) and the 
oracle strategy (dotted hne) on the rephcation corresponding to the median 
error. The associated sample is represented by the points ("x"). 
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Figure 6: Comparison of the true quantile of order a = 1/500 (solid line) 
with the estimated quantiles by the heuristics strategy (dashed line) and the 
oracle strategy (dotted hne) on the rephcation corresponding to the median 
error. The associated sample is represented by the points ("x"). 
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Pareto 



Frechet 



Burr 



q{a,t) 



a 



-lit) 



a ■^(*) <! - log ( ^— 
a V 1 — a 



-7(<) 



a 



-lit) (^i-a-^'W) 



-7(i)/p(i) 



A{a,t) 



lit) 



a(l + 0(Q)) 



-7(t)a-''W(l + 0(a-''W)) 



Table 1: Some examples of heavy-tailed distributions. For all distributions, 
7(t) > is the tail-index and p{t) < is referred to as the second-order 
parameter in extreme-value theory. 
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